The presence of highly similar notes within the MIMIC-III dataset

نویسندگان

  • Rodney A. Gabriel
  • Sanjeev Shenoy
  • Tsung-Ting Kuo
  • Julian McAuley
  • Chun-Nan Hsu
چکیده

one is compiling statistics or training predictive algorithms that model the language or attributes in notes. We developed an algorithm to identify and characterize highly similar notes within the Multiparameter Intelligent Monitoring in Intensive Care (MIMIC-III) dataset. We found that there were multiple instances of exact copies, common outputs, and template notes form the public domain MIMIC-III dataset. Abstract

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Epoxidation of Olefins with Tetra n-Butyl-Ammonium Periodate in the presence of tetrakis (4-Substituted Phenyl) Porphyrinatiomanganese(III) Acetates and Imidazole

The epoxidation of olefins with tetra n-butylammonium periodate, n-Bu4/NIO4, is catalyzed by six different tetrakis (4-substituted phenyl) porphyrinatomanganese(III) acetate, Mn(T4-XPP)OAc, Complexes in the presence of imidazole as an axial ligand with low to high yields and complete selectivity at room temperature. While the electronic effects of the highly electron-w...

متن کامل

Synthesis and Investigation the Catalytic Behavior of Cr2O3 Nanoparticles

The use of an inorganic phase in water-in-oil (w/o) microemulsion has recently received considerable attention for preparing metal oxide nanoparticles. This is a technique, which allows preparation of ultrafine metal oxide nanoparticles within the size range 40 to 80 nm. Preparation of nano chromium (III) oxide studied investigated in the inverse microemulsion system. Therefore the nucleation o...

متن کامل

بررسی هم بستگی و تکرارپذیری آماره های پارامتری و چندمتغیره پایداری عملکرد دانه در جو دیم

Multi-environment trial data are required to obtain stability performance parameters as selection tools for effective cultivar evaluation. The interrelationship among several stability parameters and their associations with mean yield, along with the repeatability of these parameters in consecutive years was the objective of this study. Barley yield data of 18 cultivars, proprietary of Dryland ...

متن کامل

MINING FUZZY TEMPORAL ITEMSETS WITHIN VARIOUS TIME INTERVALS IN QUANTITATIVE DATASETS

This research aims at proposing a new method for discovering frequent temporal itemsets in continuous subsets of a dataset with quantitative transactions. It is important to note that although these temporal itemsets may have relatively high textit{support} or occurrence within particular time intervals, they do not necessarily get similar textit{support} across the whole dataset, which makes i...

متن کامل

بررسی هم بستگی و تکرارپذیری آماره های پارامتری و چندمتغیره پایداری عملکرد دانه در جو دیم

Multi-environment trial data are required to obtain stability performance parameters as selection tools for effective cultivar evaluation. The interrelationship among several stability parameters and their associations with mean yield, along with the repeatability of these parameters in consecutive years was the objective of this study. Barley yield data of 18 cultivars, proprietary of Dryland ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017